WCCL: A Morpho-syntactic Feature Toolkit
نویسندگان
چکیده
The paper presents WCCL, a new formalism and toolkit for constructing morpho-syntactic features, a crucial task for many natural language processing algorithms. One existing solution, JOSKIPI, is analysed from two perspectives: features of the formalism as well as software engineering-related issues. Then we propose its successor. A short case study follows, exemplifying the improvement enabled by using rich features expressed with WCCL. The formalism is targeted at Polish, although it seems well suited for any inflectional language.
منابع مشابه
Constraint Based Description of Polish Multiword Expressions
We present an approach to the description of Polish Multi-word Expressions (MWEs) which is based on expressions in the WCCL language of morpho-syntactic constraints instead of grammar rules or transducers. For each MWE its basic morphological form and the base forms of its constituents are specified but also each MWE is assigned to a class on the basis of its syntactic structure. For each class...
متن کاملA Study on Morpho-Syntactic Patterns: A Cohesive Device in Some Persian Live Sport Radio and TV Talks
Morpho-syntactic patterns device encompasses a subcategory of the cohesive devices that assists hearers to have an adequate mental representation for understanding speech. This article investigates the morpho-syntactic patterns employed in some Persian live sport radio and TV programs adapting Dooley and Levinsohn’s theoretical and analytical framework. The research data includes around 30,000 ...
متن کاملComparison of the high-frequency morpho-syntactic structures of cochlear implant children and children with normal hearing aged 4-6 years
Introduction: Children with cochlear implants experience problems at all language domains, and have more problems in morpho-syntactic skills than others domains. Considering the importance of morphology and syntax in developing of communication skills of children, this study compared the use of high-frequency morpho-syntactic structures among 4-6 years old children with cochlear implants and ty...
متن کاملA Morpho-Syntactic Analyzer of Controlled Japanese
The proposed morpho-syntactic analyzer parses controlled Japanese texts such as articles in newspapers, technical magazines and professional journals and public documents that are transcribed wherever applicable by using Joyo Kanji (frequently used Chinese characters). The analyzer parses sentences in controlled Japanese texts into morpho-syntactic units, further dividing them into the content ...
متن کامل